Proceedings of The Intl. Conf. on Information, Engineering, Management and Security 2014 [ICIEMS 2014] 



67 



Content Based Image Retrieval Using 

GLCM & CCM 



B. Santosh Kumar 
Asst. Professors, Dept. of ECE, CJITS, Jangaon, Warangal, AP, India. 

Abstract — The growth of digital image archives is increasing the need for the tools that effecth^^filrcr 
and efficiently search through large amounts of visual data. Towards this goal we propose a tecj^ni^e by 
which the color content and texture features of images is automatically extracted by using cow^t based 
image retrieval (CBIR). CBIR has become one of the most active research areas in the pasti^^^ars. Many 
visual feature representations have been explored. Image CBIR is emerging as an imjjjortaW research area 
with application to digital libraries, data mining, education, medical imaging, 
multimedia databases. The focus is on image processing aspects and in particular^ 
features. Color feature is extracted by HSV. The color feature is represented by^x( 
feature extraction is obtained by using gray-level co-occurrence matrix (G\ 
matrix (CCM). Through the quantification of HSV color space, we combine^colc 
as CCM separately. V 
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l .Introduction 

Content-based image retrieval (CBIR), also known as xroep^by image content (QBIC) and content-based 
visual information retrieval (CBVIR) is the applicatio^^Promputer vision techniques to the image retrieval 
problem, that is, the problem of searching for digit*Aiages in large databases.. 




"Content-based" means that the search wiitaHalyze the actual contents of the image rather than the 
metadata such as keywords, tags, and/oj^S^^tions associated with the image. The term 'content' in this 
context might refer to colors, shapes, tf 4jjj£s, or any other information that can be derived from the image 
itself. CBIR is desirable because me^^^eb based image search engines rely purely on metadata and this 
produces a lot of garbage in the asults: 
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Diagram for content-based image retrieval system 



^Represented using such low level image features as color, texture and shape, 

which are extracted from images automatically. 
Among this level features, color features are the 
most widely used features for image retrieval 
because color is the most intuitive feature and can 
be extracted from images conveniently. However 
image retrieval using color and features often gives 
disappointing results, because in many cases, image 
with similar colors do not have similar content. 
This is due to the global color features computed 
often fails to capture color distributions or textures 
within the image. Several methods have been 
proposed to incorporate special color information 
in attempt to avoid color confusion by machine. 
However, this methods often results in very high 
dimensions of features which drastically slow down 
the retrieval speed of the system. In this paper we 
propose a method combining both color and 
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texture features to improve retrieval performance. We compute both the color and texture features from 
the images and images in the database are indexed using both type of features. During retrieval process, 
given a query image, images in the database are firstly ranked using color features. Then in the second 
process, a number of top ranked images are selected and re-ranked according to their texture features. 
Because the texture features are extracted globally from the image, they are not an accurate description of 
the image in some cases. Therefore, we provide two alternatives to user, one is the retrieval based on color 
features, and the other is retrieval based on combined features. When the retrieval based on color fails, the 
user will use the other alternative which is the combined retrieval. 



2. Color Feature Extraction 



So' 



ue HSV 



Basically there are three properties or three dimensions of color that being hue, saturation tf&y 

fltcd on three 



means Hue, Saturation and Value. It is important to look at because it describes the col 
properties. It can create the full spectrum of colors by editing the HSV values. Th 
Hue. Hue is the other name for the color or the complicated variation in the color. ( 
determined by its dominant wavelength. This Hue is broadly classified into 
primary Hue, Secondary Hue and Tertiary Hue. 




: cHfcaension is the 
uality of color as 
gories. They are 



tke saturation. Saturation just 
ixing the colors or by adding 
the color becomes lighter. Then 
nas^l^fhe color. When the value is zero the 
color space is totally black with the increase in the color there ilyj|o increase in the brightness and shows 



Saturation is the degree or the purity of color. Then the second dimen^if 
gives the intensity to the colors. The saturation and intensity drop 
black to the color. By adding the white to the color in spite of more^i 
finally the third dimension is the Value. The value is the brightna 




the various colors. In accordance with the quantization 
vector for different values of with different weights to 
the following equation: 
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G = Qs*Qv*H+Qv*s+V 

Where Qs is the quantized series of S and (w\tni 
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bfcve, the H, S, V three-dimensional feature 
*e dimensional feature vector and is given by 



e quantized series of V. And now by setting 
Qs = Qv = 3, Then G = 9H+3S+V 
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In this way three component vector of the 
HSV from one dimensional vector, Which 
quantize the whole color space for the 72 
kinds of the main colors. So we can handle 72 
bins of one dimensional histogram. This 
qualification is effective in reducing the 
images by the effect of the light intensity, but 
also reducing the computational time and 
complexity. 

3. Proposed Method 

We are proposing two algorithms for image 
retrieval based on the color histogram and 
wavelet-based color histogram. 



3.1 Color Histogram 

Step-i. Convert RGB 
HSV color space 



color space image into 
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Step-2. Color quantization is carried out using color histogram by assigning 8 level each to hue , saturation 
and value to give a quantized HSV space with 8*8*8=512 histogram bins 

Step-3. The normalized histogram is obtained by dividing the total number of pixels. 

Step-4. Repeat step-i to step-3 on an image in the database. 

Step-5. Calculate the similarity matrix of query image and image present in the database. 
Step-6. Repeat the steps from 4 to 5 for all the images in the database. + 
Step-7. Retrieve the images. ♦ 
3.2 Wavelet-Based Color Histogram (WBCM) 
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Sl«f)2. Decompose each Red, 
fireen and Blue Component 
using Haar wavelet 
transformation at ist level to 
get approximate coefficient 
and vertical, horizontal and 
diagonal detail coefficients. 

Step3. Combine approximate 
coefficient of Red, Green and 
Blue component. 

Step4. Similarly combine the 
horizontal and vertical 
coefficients of Red, Green 
and Blue component. 

Step 5. Assign the weights to 
approximate components to 
horizontal and vertical 
coefficients. 



roximate, horizontal and vertical coefficients into HSV plane. 



Stepy. Colo^j^umtization is carried out by color histogram by assigning 8 level to HSV space with 8*8*8=512 
histogra 



DS^fhe normalized histogram is obtained by dividing with total number of pixels. 
Step9. Repeat stepi to step8 on an image in the database. 

Stepio. Calculate the similarity matrix of query image and image present in database. 
Stepn. Repeat the steps from9 to 10 for all the images in the database. 
Stepi2. Retrieve the images. 
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Block diagram of proposed Wavelet-Based Color Histogram ( WBCH). 



4. Texture Feature Extraction by GLCM 

Gray-co-matrix function can be used to create the 
GLCM (Gray level co-occurance matrix). 
Graycomatrix function calculates how often the 
relationship between the pixel value i occurs with 
respect to the pixel value j. The pixel to its 
immediate right and by default the spatial 
relationship is defined as the pixel of interesty^en 
though the spatial relation between the tw(§^l^els 
is verified. Each element in the GLCJVUis naming 
but the sum of the number of times JjAQne pixel 
value i occurs with relation to thyftj^ value j.in 
the input image. For the full rflvnarl^c range of an 
image the processing requireE^^alculate a GLCM 
is prohibitive. The input ijp^jg^was scaled by the 
gray matrix. By defaukCj^ reduce the intensity 



values from 256 to 8 in Grayscale image graycomatrix use scaling. Using the rfpyievels and the gray limits 
parameters of the graycomatrix function the number of gray levels and ^hj^agng of the intensity values in 
the GLCM can be controlled. The properties about the spatial distri^fi^o/f the Gray level in the texture 
image can be revealed by the Gray level co-occurance matrix. 



The following figure shows how gray co matrix calculates the 
GLCM, element (1, 1) contains the value 1 because there is oi 
horizontally adjacent pixels have the values 1 and 1, respi 
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;) and recording the sums in the correspondin^jele 
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GLCM expresses the texture feature according 
the correlation of the couple pixels Gray level at 
different positions. It quantification ally 
describes the texture features. But here mainly 
four things are considered they are energy, 
contrast, entropy and the inverse difference. 



Energy: 





i ; 

It is a gray scale image texture measure of the 
homogeneity changing reflecting the 



distribution of the image gray-scale uniformity of the image and the texture. 



Contrast: 
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Contrast is the main diagonal near the moment of inertia, Which measures the value of the matrix is 
distributed and images of local changes in the number, reflecting the image clarity and the texture of the 
shadow depth if the contrast is large then the texture is deeper. 

Entropy: 



* »' 

Entropy measures image texture randomness, when the space co- occurrence matrix for all val 
it achieved the minimum value; on the other hand, if the value of co-occurrence matrix is yj 
value is greater. Therefore, the maximum entropy implied by the image gray distributia 



Inverse difference: 



Dp istor 




It measures local changes in image texture number. Its value 
between the different regions of the lack of change and partial 
value at the co ordinate (x , y). 




;e is illustrated that image texture 
evenly. Here p(x, y) is the gray level 




4.1 Texture Feature 
Extraction Based on CCM 

A co-occurance matrix or co- 
occurance distribution is a matrix 
or distribution that is defined over 
an image to be the distribution of 
co-occurring values at a given 
offset. Mathematically a co- 
occurance matrix c is defined over 
an n*m image I, Parameterized by 
an offset. The value of the image 
originally referred to the grayscale 
value of the specified pixel. The 
value could be anything, from a 
binary on/off value to 32-bit color 
and beyond. Note that 32-bit color 
will yield a 2 32 x2 32 co-occurrence 
matrix. Really any matrix or pair of 
matrices can be used to generate a 
co-occurrence matrix, though 
their main applicability has been 
in the measuring of texture in 
images, so the typical definition, as 

above, assumes that the matrix is in fact an image. It is also possible to define the matrix across two 
different images. Such a matrix can then be used for color mapping. The statistic features extracted from 
the CCM are as follows :- 
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Energy E = Y Y [mQ t j)f 

0 D 

Contrast / = Y^YQ-j) 2 m(tj) 
« / 

n n 

Entropy S = -TY ; ) • \og[m(L j)] 
where, if ?w(/,;) = 0, log[w(/.y)) = 0 
Inverse difference H = YY- m(LJ) 
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6. Applications 

1. In the military to find tanks or airstrips. 

2. In urban development to determine the extent of housing sprawl. 

3. In local government to track highway assets. 

7. Conclusion 

We presented a novel approach for Content Based Image Retrieval by combining the color and tenure 
features called Wavelet-Based Color Histogram Image Retrieval (WBCHIR). Similarity between the^mag^s 
is ascertained by means of a distance function. The experimental result shows that the propose^ mStnod 
outperforms the other retrieval methods in terms of Average Precision. Moreover, the computafoftat steps 
are effectively reduced with the use of Wavelet transformation. As a result, there is a^^^itation ally 
increase in the retrieval speed. The whole indexing time for the 1000 image database takes minutes. 
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Experiment Results 
l. Color + GLCM retrieval 
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2. Color + CCM retail 
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